Trainability and Accuracy of Artificial Neural Networks: An Interacting Particle System Approach
نویسندگان
چکیده
Neural networks, a central tool in machine learning, have demonstrated remarkable, high fidelity performance on image recognition and classification tasks. These successes evince an ability to accurately represent dimensional functions, but rigorous results about the approximation error of neural networks after training are few. Here we establish conditions for global convergence standard optimization algorithm used learning applications, stochastic gradient descent (SGD), quantify scaling its with size network. This is done by reinterpreting SGD as evolution particle system interactions governed potential related objective or "loss" function train We show that, when number $n$ units large, empirical distribution particles descends convex landscape towards minimum at rate independent $n$, resulting that universally scales $O(n^{-1})$. properties established form Law Large Numbers Central Limit Theorem distribution. Our analysis also quantifies scale nature noise introduced provides guidelines step batch use illustrate our findings examples which learn energy continuous 3-spin model sphere. The predicts dimension $d=25$.
منابع مشابه
Trainability in Recurrent Neural Networks
Two potential bottlenecks on the expressiveness of recurrent neural networks (RNNs) are their ability to store information about the task in their parameters, and to store information about the input history in their units. We show experimentally that all common RNN architectures achieve nearly the same per-task and per-unit capacity bounds with careful training, for a variety of tasks and stac...
متن کاملAn Approach of Artificial Neural Networks Modeling Based on Fuzzy Regression for Forecasting Purposes
In this paper, a new approach of modeling for Artificial Neural Networks (ANNs) models based on the concepts of fuzzy regression is proposed. For this purpose, we reformulated ANN model as a fuzzy nonlinear regression model while it has advantages of both fuzzy regression and ANN models. Hence, it can be applied to uncertain, ambiguous, or complex environments due to its flexibility for forecas...
متن کاملPrediction the Return Fluctuations with Artificial Neural Networks' Approach
Time changes of return, inefficiency studies performed and presence of effective factors on share return rate are caused development modern and intelligent methods in estimation and evaluation of share return in stock companies. Aim of this research is prediction of return using financial variables with artificial neural network approach. Therefore, the statistical population of this study incl...
متن کاملCapacity and Trainability in Recurrent Neural Networks
Two potential bottlenecks on the expressiveness of recurrent neural networks (RNNs) are their ability to store information about the task in their parameters, and to store information about the input history in their units. We show experimentally that all common RNN architectures achieve nearly the same per-task and per-unit capacity bounds with careful training, for a variety of tasks and stac...
متن کاملscour modeling piles of kambuzia industrial city bridge using hec-ras and artificial neural network
today, scouring is one of the important topics in the river and coastal engineering so that the most destruction in the bridges is occurred due to this phenomenon. whereas the bridges are assumed as the most important connecting structures in the communications roads in the country and their importance is doubled while floodwater, thus exact design and maintenance thereof is very crucial. f...
ذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Communications on Pure and Applied Mathematics
سال: 2022
ISSN: ['1097-0312', '0010-3640']
DOI: https://doi.org/10.1002/cpa.22074